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Abstract. Interplanetary Coronal Mass Ejections (ICMEs) are the man¬ 
ifestation of solar transient eruptions, which can significantly modify the plasma 
and magnetic conditions in the heliosphere. They are often preceded by a 
shock, and a magnetic flux rope is detected in situ in a third to half of them. 
The main aim of this study is to obtain the best quantitative shape for the 
flux rope axis and for the shock surface from in situ data obtained during 
spacecraft crossings of these structures. We first compare the orientation of 
the flux ropes axes and shock normals obtained from independent data anal¬ 
yses of the same events, observed in situ at 1 AU from the Sun. Then, we 
carry out an original statistical analysis of axes/shock normals by deriving 

4 Observatoire de Paris, LESIA, UMR 
8109 (CNRS), 92195 Meudon Principal 
Cedex, France 

5 Departamento de Fisica and IFIBA, 

Facultad de Ciencias Exaetas y Naturales, 

Universidad de Buenos Aires, 1428 Buenos 
Aires, Argentina 

6 Space Science Center and Department of 
Physics, University of New Hampshire, 

Morse Hall, 8 College Rcl, Durham, NH, 

03824, USA 


DRAFT 


March 23, 2015, 12:23am 


DRAFT 



JANVIER ET AL.: COMPARING GENERIC MG AXIS & SHOCK SHAPES 


X - 3 


the statistical distributions of their orientations. We fit the observed distri¬ 
butions using the distributions derived from several synthetic models describ¬ 
ing these shapes. We show that the distributions of axis/shock orientations 
are very sensitive to their respective shape. One classical model, used to an¬ 
alyze interplanetary imager data, is incompatible with the in situ data. Two 
other models are introduced, for which the results for axis and shock nor¬ 
mals lead to very similar shapes; the fact that the data for MCs and shocks 
are independent strengthen this result. The model which best fit all the data 
sets has an ellipsoidal shape with similar aspect ratio values for all the data 
sets. These derived shapes for the flux rope axis and shock surface have sev¬ 
eral potential applications. First, these shapes can be used to construct a con¬ 
sistent ICME model. Second, these generic shapes can be used to develop 
a quantitative model to analyze imager data, as well as constraining the out¬ 
put of numerical simulations of ICMEs. Finally, they will have implications 
for space weather forecasting, in particular for forecasting the time arrival 
of ICMEs at the Earth. 
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1. Introduction 

Interplanetary coronal mass ejections (ICMEs) are the interplanetary manifestations of 
coronal mass ejections (CMEs) and are identified by a number of typical properties that 
differ from those of the ambient solar wind [e.g., Gosling, 1990, 2000; Neugebauer and 
Goldstein, 1997; Wimmer-Schweingruber et al, 2006; Zurbuchen and Richardson, 2006; 
Rouillard, 2011]. A region of compressed plasma and magnetic field, called the sheath, is 
typically present at the front of ICMEs. When the leading front is faster than the ambient 
solar wind (above the local fast MHD mode speed), an ICME is preceded by a shock. 

An ICME can be distinguished from the ambient solar wind by analyzing both remote 
and in situ observations, with remote observations being the privileged method to give 
hints on their 3D shape. Nonetheless, coronagraphs and heliospheric imagers still only 
provide 2D images of the denser parts of CMEs through the Thomson scattering of white- 
light by free electrons [ Howard , 2011; Thernisien et al., 2011]. Furthermore, since the 
location of the scattered light cannot unambiguously be determined along the line of 
sight, a shape model is typically needed to analyze the observations. 

Classical shape models are point-like, or are represented by a sphere surrounding or 
attached to the Sun [Lugaz et al., 2010, and references therein]. These models allow 
the estimation of the propagation direction and the speed of ICMEs, especially when 
triangulation with different spacecraft is possible [Liu et al, 2013]. More elaborated 
analytical models have been proposed such as a dense shell located around a flux-rope like 
shape [ Thernisien et al, 2006], parametric models of sheath shapes [Tappin and Howard, 
2009; Howard and Tappin, 2010] and another parametric shape applied to both sheaths 
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and flux ropes was proposed by Wood and Howard [2009]. All these models were fitted 
visually to CMEs observed with coronagraphs and/or heliospheric imagers. However, 
it remains difficult to quantify the quality of the fit for each of these models, and to 
confirm whether the derived shape is indeed representative of the CME 3D shape ( e.g. 
multiple solutions are possible for a given set of CME observations). Finally, while other 
techniques have been tested, as reviewed by Mierla et al. [2010], present multi-spacecraft 
remote observations cannot uniquely determine the 3D shape of the bright front, even 
when they can be tracked and be unambiguously connected to the CME source [ Harrison 
et al, 2008; Mostl et al. , 2009; Lugaz and Roussev, 2011]. 

Magnetic Clouds (MCs) can be distinguished inside a fraction of ICMEs, when they are 
observed in situ. A MC is identified as a structure presenting an enhanced magnetic field 
intensity, a large scale and coherent magnetic field rotation (associated with the passage of 
a large scale flux rope, FR), and low proton temperatures [e.g., Burlaga et al., 1981; Dasso 
et al, 2005]. Theoretical models describing the global shape of MCs in the heliosphere have 
been developed and compared with single spacecraft in situ observations [e.g., Marubashi 
and Lepping, 2007; Hidalgo and Nieves-Chinchilla, 2012], However, reconstructing the 
3D global FR shape from in situ measurements is an ill-posed problem, as there is no 
unique solution. Moreover, the models frequently contain many free parameters, so that 
minimizing the difference between observations and models can sometimes provide several 
compatible solutions. 

Simultaneous multi-spacecraft observations of a given event can provide a better un¬ 
derstanding of the global shape of MCs. Some studies have shown that the directions of 
the local axis of a given MC, at well separated locations from different spacecraft, are 
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consistent with a smooth global axis of the cloud in the heliosphere (see e.g., Burlaga 
et al [1981] and Figure 2 in Ruffenach et al. [2012]) but this is not always the case (e.g., 
Farrugia et al. [2011]). However, this kind of simultaneous observations of the same event 
is not frequent. 

Numerical simulations have also been used to better understand the propagation and 
evolution of MCs in the solar wind [e.g., Riley et al, 2003; Manchester et al., 2004; 
Lugaz and Roussev, 2011]. There are however limitations for emulating some physical 
processes, such as magnetic reconnection, the effects of the turbulence on macroscopic 
structures such as the drag [e.g., Matthaeus and Velli , 2011], or the dynamical evolution 
of structures strongly dominated by the magnetic field (such as magnetic clouds). Some 
ICME propagation properties can be found by combining observations with simulations 
[e.g., Lugaz et al., 2009; Lugaz and Roussev, 2011]. 

Recently, Janvier et al. [2013] analyzed the distribution of MC local orientation from a 
sample of more than 100 events observed at 1 AU by Wind, and derived the mean shape 
of the MC axis with an original statistical method. A comparable technique was also used 
for deriving the shock surface driven by ICMEs from observations of the shock normal in 
a sample of more than 250 events observed at 1 AU by the ACE spacecraft [ Janvier et al, 
2014a], 

In this paper, we apply, extend and combine the techniques previously used [ Janvier 
et al, 2013, 2014a], with the aim of determining the generic shape of the MC axis and the 
shock surface, combining different samples/databases for MCs and shocks. In Section 2, 
we present the samples of MCs and shocks we use in this statistical study, and provide 
a comparison of the results given for the same events analyzed by different authors. We 
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also define the angles used to determine the orientation of the MC axis and normal to the 
shock surface. In Section 3, we first set the techniques to derive the axis and shock shapes 
with the same formalism. Then, we apply them to various data sets and we compare these 
results to three analytical models. We find that one of the models is the closest to all the 
observations analyzed, and we test the robustness of our results. Then we summarize our 
main results in Section 4 and outline the implications of our results in Section 5. 

2. Samples of observed MCs and shocks and definition of the shape parameters 
2.1. Sets of observed MCs, similarities and differences 

In the present study, we use three lists of MCs where the FR parameters are already 
determined by a common method, namely, the data were fitted with the static, linear 
force-free cylindrical model, also known as the Lundquist’s model [see e.g. Lundquist, 
1950; Goldstein, 1983; Lepping et al, 1990]. A detailed description of the model and its 
limitations can be found in Lepping et al. [1990, 2003] and more information on the fitting 
methods used can be found in Lynch et al. [2003] and Feng et al. [2007]. 

Lynch et al. [2005] analyzed 132 MCs observed nearby Earth by Wind and ACE space¬ 
craft during the period 1995-2003. Feng et al. [2010] analyzed 62 MCs preceded by a 
shock and observed by Wind during the period 1995-2007. Finally, we use an extended 
list of events (Table 2 at http://wind.nasa.gov/mfi/mag_cloud_Sl.html) which is based on 
the results of Lepping and Wu [2010] and includes more recent MCs. This list contains 
the parameters obtained for 121 MCs observed by Wind spacecraft during the period 
1995-2009. We remove from Lepping and Wu' s list the MCs that were crossed too close 
from their boundaries, limiting the list to 107 MCs [for coherence with previous study, 
see Janvier et al., 2013]. 
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The above fitting method has several limitations. For example, due to its simplicity, 
the model does not take into account the expansion of MCs, although this could affect the 
fitted model parameters and the derived quantities (e.g. see Tables 3 and 4 in Nakwacki 
et al [2008]). Another important limitation is the assumed circular cross section, and 
as such several non-circular models have been proposed [e.g. Hu and Sonnerup, 2002; 
Vandas et al. , 2005; Hidalgo, 2003]. From a statistical study of the closest approach of 
the spacecraft from the MC axis, using the Lepping and Wu' s list, Demoulin et al. [2013] 
showed that the MC cross section is flatter in the radial direction (from the Sun) by a 
third to a half on average. Indeed, some MCs have a relatively flattened cross section 
[Vandas et al., 2005; Antoniadou et al, 2008; Farrugia et al, 2011], while others have a 
more round cross section, especially at their cores [Hu and Dasgupta , 2005; Liu et al, 
2008; Mostl et al, 2009; Isavnin et al, 2011]. The effects of a non-circular cross section 
on the derived properties of MCs (e.g. their axis direction) remain to be characterized 
for large sets of MCs. So far such lists are only available for the fit with the Lundquist 
model, so with a circular cross section. 

While the three MC lists used in the present study [Lynch et al, 2005; Feng et al, 
2010; Lepping and Wu, 2010] are based on the same fitting method to determine the FR 
parameters, the fit has been applied in different ways to the various MC samples. For 
example, Lynch et al [2005] imposed that the spacecraft closest approach to the FR axis 
is the temporal midpoint of the FR, so that the FR radius is not a parameter of the 
fit (in contrast with other authors), but is instead determined from the FR properties: 
orientation, mean velocity and impact parameter (defined as the closest distance approach 
of the spacecraft to the FR axis divided by the FR radius). Among the differences in the 
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method implementation, the difference in the FR boundary selection is, in our opinion, 
the most important, as it could have large implications on the derived FR orientation [e.g. 
Dasso et al. , 2006; Ruffenach et al, 2012], The boundaries are typically set where abrupt 
changes of the magnetic held and plasma parameters are detected [e.g. Dasso et al., 
2007]. However, if such changes are not co-located, then different criteria can provide 
different boundary locations. As such, it often occurs that one MC can be defined with 
different boundary locations depending on the authors [e.g. Riley et al., 2004; Al-Haddad 
et al, 2013]. 

An important physical process during the evolution of MCs in the solar wind to be con¬ 
sidered, and which strongly affects the determination of their boundaries, is their erosion 
as they travel away from the Sun. As the magnetic held of MCs can reconnect with that 
of the ambient solar wind, a part of the original FR disappears, so that a case-dependent 
fraction of a MC can have a mixture of both MC and solar wind properties. This leads to 
large uncertainties when a FR is actually identified. Thus, a detailed analysis of each MC 
is needed to determine which remaining region of the FR is crossed by the spacecraft, and 
the ht should only be applied to this specihc part of the MC. As an example, minimum 
variance and Lundquist’s ht can result in signihcantly different orientations, without co¬ 
herence along the axis of a MC observed by four spacecraft [ACE, STEREO A and B, 
Wind; Farrugia et al., 2011], while both methods give consistent results when refined time 
intervals (e.g. considering an eroded FR) are used [Ruffenach et al, 2012], 

2.2. Sets of observed shocks 

In the present paper, we study the probability distributions of shock orientation pa¬ 
rameters from different samples. We use lists of shocks studied in Feng et al. [2010] and 
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Wang et al. [2010]. The shocks of Feng et al. [2010] were observed by Wind spacecraft 
during the period 1995-2007 and were all located at the front of a MC sheath. The list 
contains 62 shocks. Shock and shock-like events from Wang et al. [2010] were observed by 
ACE spacecraft during the period 1998-2008. They were investigated with the purpose of 
analyzing their effects on the Earth magnetosphere and ionosphere. In the following, we 
only retain the well-defined shock events, which represent a total of 216 cases (117 shocks 
in front of ICMEs and 99 shocks with no detected ICMEs behind). 

Both of these shock studies are based on a shock fitting procedure using the MHD 
Rankine-Hugoniot relations as developed by Lin et al. [2006]. The model used to deter¬ 
mine the geometry of the shock is based on the one-fluid anisotropic Rankine-Hugoniot 
relations. A Monte-Carlo calculation and a minimization technique between the generated 
models and observations were also similarly used. However, while the basic technique is 
the same in both studies, there are variations affecting the computed shock parameters. 
Firstly, the data are from two different spacecraft, involving different measurements with 
their own specificities. Secondly, the results depend on the selected time intervals, both 
for the upstream and downstream regions of the shocks, which are used to select the 
magnetic field and plasma parameters used in the minimisation technique. Finally, the 
results also depend on the terms included in the Rankine-Hugoniot relations [see the two 
methods studied in Lin et al., 2006]. As a consequence, differences between the results 
of both studies have to be expected when the same shock is analyzed. We analyze these 
differences in Section 2.4. 

2.3. Angles defining the MC axis and shock normal 
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The directions of the FR local axis and the shock normal are obtained from the fitting 
and minimisation procedures for both MCs and shocks, respectively. For in situ observa¬ 
tions at 1 AU, i.e. nearby Earth, unit vectors used to express directions are classically 
defined in the GSE system of reference. Setting a spherical coordinate-system with a 
South-North polar axis, the orientation vector is classically defined by its latitude and 
longitude. However, the polar axis direction is singular, as it corresponds to any values 
of longitude. Therefore, as the absolute value of the latitude becomes larger, any small 
change in the vector orientation leads to a large uncertainty in the longitude determina¬ 
tion. This is a problem when characterizing the orientation of the axis of a MC, which may 
be highly inclined to the ecliptic. However, MC axes are found to rarely lie in the radial 
direction. Thus, Janvier et al. [2013] introduced a new spherical coordinate-system, with 
a polar axis along the Sun-Earth direction (along -x. GS e)- Projecting the unit vector on 
a plane perpendicular to x.qse, they introduced the inclination angle i on the ecliptic. It 
is measured from the West-East direction (y gse) in a clockwise direction (when looking 
toward the Sun, see Figure la). A second angle was defined as the location angle A, which 
measures the angle between the FR axis direction and the ortho-radial. Equivalently, A is 
also the angle between the radial direction and the normal to the FR axis (n axis located 
within the plane i = constant, Figure lb). Then, for both cases (shock normal and MC 
axis), A is defined in the same way, as the angle between -x. G se and n (n s h oc k or n axis ) and 
both A axis and A s h oc k measure the location of the spacecraft crossing if the MC/shock axis 
shape is known. For A axis and A s h oc k values close to zero, the spacecraft crossing is close 
to the apex of each structure, while both A ax i S and A s h oc k increase as the crossing is more 
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on the flank of the encountered structure, therefore defining A as a “location” angle (see 
Section 3). 

In summary, (A ,i) defines a new spherical coordinate system where —x cse is the polar 
axis. Both angles A and i are related with the latitude 6 and the longitude 0 of n axis or 


n s hock by: 


sin A = — cos 0 cos 6 , 

(1) 

tan i = tan 6/ sin 0 . 

(2) 


Then, A and i are both functions of 0 and 6. However, with MC data [ Janvier et al., 
2014b] as well as for shocks (not shown), A is mostly correlated with 0, and i with 9. 

2.4. Precision of the axis and shock normal directions 

The typical level of precision that can be determined for n axis and n shock is investigated 
in more detail below, by comparing the orientation parameters for events that are common 
to different data sets. To do so, we associated events by comparing the times defining the 
boundaries of MCs or shocks. 

The time window for the association of MCs from different lists was set to 10 hours, for 
both front and rear boundaries. We chose a rather large window, as the definition of the 
MC boundaries is dependent on the plasma and magnetic data used, as well as on the 
authors’ selection criteria. However, we also computed an association with a smaller time 
window (e.g. 5 hours), which provided comparable results but with less cases. Extending 
the time window to more than 10 hours, however, leads to a few MCs of one list associated 
to two MCs of another list, and therefore wrong results. 
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For shocks, the time window was set to 1 hour, as shocks are better temporally localized 
than MC boundaries. We also included in this time window the delay between ACE and 
Wind observations. Note that further extending this time window {e.g. 2 hours) only 
provides a few more associations. 

The Lepping and Wu’s list has 45% and 49% of MCs in common with the Lynch’s and 
Feng’s list, respectively, and the Wang’s list has only 14% of shocks in common with the 
Feng’s list (which is limited to shocks associated to MCs). This indicates that we can 
compare the results obtained for the same events, while also having a significant number 
of independent events. 

The relations between A and % for the same events in different pairs of event lists are 
analyzed in Figure 2. We show there the values of |A|, i.e. we have grouped together 
the values of A obtained in the west and east legs of the MCs, increasing the statistics 
(this is also justified by a weak asymmetry between the two MC legs, as found in Janvier 
et al. 2013). Moreover, this allows the direct comparison of the results obtained for the 
MCs with those obtained for the shocks, as the sign of A cannot be determined for the 
latter. For conveniency, we mark in the following and in all graphs A instead of |A|. The 
data are fitted by a linear function (in blue), which provides an easy visual interpretation 
of how close two results are for one same event. We also report two correlation-ranking 
coefficients, the Pearson (c p ) and Spearman (c s ) ones, as well as the standard deviation 
(sd) between the two data sets. As a whole, the location angle A (left panels) between 
two samples is not as well correlated as the inclination angle i (right panels). A also has a 
larger global bias since the linear fit (blue line) is located further away from the identity 
line (in brown) than for the i angle, for which both lines almost coincide. Finally, A has 
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a lower standard deviation than i by a factor 2 to 5, but this is not a striking result 
considering that the range of variation for i (« [—150°, 150°]) is five times larger than the 
range of variation for A (~ [0°,60°]) so that the standard deviation relative to the range 
of variation is broadly comparable for A and i. 

For MCs, the quality of the fit is measured by y, the square root of the chi-square 
function between the measurement of the magnetic field inside the FR and the fitted 
profile. The values of both \ and the impact parameter, p, could a priori affect the 
precision of n axis . In fact, we found no significant improvement in the correlation-ranking 
coefficients, nor in the standard deviation values, when we analyzed sets of data containing 
only lower \ values. Next, by taking the common cases in Lepping and Wu’s and Feng’s 
lists of MCs, we found that both the correlation coefficients and the standard deviation 
value are not significantly affected by \p\. However, if Lynch’s MCs are associated with 
either Lepping and Wu’s or Feng’s MCs, the correlation coefficients significantly increase 
as the maximum value of \p\ decreases from 1 to 0.5. Thus, the results of Lynch et al. 
[2005] depend more on the magnitude of the estimated \p\ values than for the other two 
lists. 

The large dispersion in the values of A found by different authors, especially for its lower 
values, is not straightforward to interpret. Indeed, for low values of A (he. near the apex), 
one would expect that shocks, and similarly MCs, have a better determined normal as the 
spacecraft crosses its structure orthogonally. These expectations appear to be not true 
considering the results found in Figure 2. A possible candidate to account for such an 
effect may be the determination of the flux rope boundaries. Thus, the location of the 
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MC/shock crossing (as quantified by A) is not an important parameter in determining the 
precision of the normal/axis direction. 

In conclusion, the large dispersion of A and % angles between different MC lists is con¬ 
sistent with the results of Al-Haddad et al. [2013] who tested the results of a larger variety 
of methods but on a more limited sample of MCs. This large dispersion, both for the MC 
axis and shock normal directions, does not allow the comparison of these directions for 
individual MCs [as attempted by Feng et al., 2010]. However, we will show in Section 3 
that the distributions of A provide robust information on the MC axis and the shock 
shapes. 

2.5. Probability distributions of the orientation parameters 

Analyses of the inclination angle i for the 107 MCs detected by Wind [ Lepping and 
Wu, 2010] and for the 216 shocks detected by ACE [Wang et al, 2010] were performed by 
Janvier et al. [2013, 2014a, respectively]. Taking into account the statistical fluctuations of 
limited samples, no global tendency was found for i (its probability distribution appeared 
to be uniform for both MC and shocks). We confirm this isotropy of inclination in the 
present paper by analyzing the histograms of i for the MCs analyzed by Lynch et al. 
[2005] and Feng et al. [2010] and for the shocks analyzed by Feng et al. [2010], as shown 
in Figure 3 where no global tendency is present over the statistical fluctuations both for 
the full sets (blue histograms) as well as with restricted sets defined with a restricted 
range of the impact parameter (pink and light brown histograms) as described at the end 
of this section. We selected 20 bins in the histograms as a compromise between statistical 
fluctuations in each bin and a visualization of the variations with i. 
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We interpret this isotropy of inclination as follows. The MCs and the majority of the 
shocks observed at 1 AU are generated by CMEs launched from the Sun. During a time 
period of several years, as covered by the data sets, the Sun launched CMEs from a large 
number of unrelated source regions with diverse orientations. If there is no preferential 
orientation in the sources, a spacecraft is expected to cross the MCs and shocks associated 
with the CMEs with an approximative uniform probability of i. As such, the above results 
indicate that there is no privileged direction of n axis and n shock around the Sun-apex line 
(within the limits of the statistical fluctuations). 

In contrast, the probability distributions of A (Figure 4), for both MCs and shocks, are 
largely non-uniform, even when considering statistical fluctuations (which are typically of 
the order of ^/n~ c where n c is the number of cases in an histogram bin). All the MC axis 
distributions are globally decreasing functions of A (except for the narrow peak at A ~ 30° 
for Lynch et al ., however this peak may not be statistically significant). Then the axis 
distributions are different than the shock distributions, which all peak around A ~ 30° 
(Figure 4). Even though subgroups of events lead to larger statistical fluctuations, the 
shape of the distributions remains almost the same. For example, similar histograms are 
found when considering subgroups of MCs with different fitting qualities [see e.g. Figure 5 
in Janvier et al, 2013], or when considering subgroups of shocks corresponding to different 
categories of ICMEs [see e.g. Figure 5 in Janvier et al, 2014a, and Figure 4 for shocks 
associated to MCs]. 

The quality of the estimation of the FR axis direction decreases with the absolute value 
of the impact parameter \p\ [e.g. Riley et al, 2004], which is related to the distance of 
the spacecraft trajectory to the FR axis. We investigated the effect of \p\ on the above 
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distributions, and found no significant effect for both parameters i and A: the histograms 
with \p\ < 0.7 (pink) and \p\ < 0.5 (light brown) have a similar dependence with i 
and A than when all cases are considered (blue) as shown in Figures 3 and 4. This is 
because large \p\ values are present in all the values taken by i and A, so that the general 
tendencies remain the same. The sample that has the largest relative number of cases with 
\p\ > 0.5 is that of Lynch et al. [2005], with 45% of the cases (Figure 4, top middle panel). 
We anticipate that the conclusion derived from the general properties of the orientation 
parameters distributions will be weakly affected by the (non-)inclusion of the cases with 
large |p| values. 

3. Best synthetic models to fit in situ data 

In this section, we explore the properties of the shapes given by different synthetic mod¬ 
els, for both the axis of MCs and the shock fronts. These properties are directly compared 
with those given by in situ data, allowing us to select the best models to reproduce the 
mean shape of both MC axis and shocks. To do so, the probability distributions of the 
parameters describing these synthetic models are least-sqnare fitted to the distribution of 
the observed MC axis and shock normal. 

3.1. Generic equations for axisymmetric models 

We first start with the description of the basic and general equations that we use to 
describe the models. These equations were previously introduced for the MC axis and 
shock front [ Janvier et al ., 2013, 2014a] using spherical coordinates (p, <F, ip) centered on 
the Sun (S). For a given point on the shock surface, <f> is the angle defining the position 
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around the Sun-apex line, while for the MC axis, <f> is constant since the axis is supposed 

to be contained in a plane for a given event. 

For the shock front, we suppose in the present work a symmetry of rotation, so that the 
shock surfaces are all independent of <f>. This simplifies the expression for p, the distance 
between the Sun (S) and the spacecraft crossing point (M). Then, the vector from the 
Sun to the point M can be written as a function of p as (Figure lb): 

SM = p(p) u p . (3) 


The location angle A is defined as the angle between the radial direction from the Sun 
(along Up) and the local normal (n) to the axis/shock shape. A is related to p(p) as: 


. n • fL, d In p 
tan A = -—/ =-- 

n • Up dp 


( 4 ) 


The above two expressions remain the same for both the shock and the MC axis. 

The shock surface extends from p = 0 at the apex to ±</? ma x,shock cm the flank, with 
a symmetry of rotation around the Sun-apex line (Figure lb). For consistency, the axis 
shape is also symmetric around the apex and it extends up to p = ±<^ m ax,axis- In both 
cases, A is included in the interval [0, 90°], with A = 0 at the apex and A approaching 90° 
in the legs of the MC axis/flank of the shock front. 

Supposing that p(p) is a decreasing function of p (so that the shape p(<p) is concave 
towards the Sun, in agreement with observations and simulations [e.g., Cane, 1985; Lugaz 
et al, 2014]), Equation(4) implies that A is a monotonously increasing function of ip. It 
implies that spacecraft crossings in the range (p ± dp correspond to the unique range 
A ± dA. The conservation of the number of cases implies 


V lfi (p)dp = V(\)d\, 


( 5 ) 
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where V v ,(p)dp is the probability of spacecraft crossings in the interval p ± dip/2 and 

P(A)dA the probability in the interval A ± dA/2. 

A spacecraft located at 1 AU, over several years, will cross a large number of ICMEs 
launched from the Sun from a broad range of latitudes and longitudes with no privileged 
direction of propagation, ft implies that one can suppose a nearly uniform distribution 
of crossings. Then, the MC axis is expected to be detected with a uniform distribution 
of tp. The normalization of the total probability to unity implies V v (p) = 1/Vmax- This 
probability is different for shocks however, since they extend as a surface: the probability 
of detection in the range p ± dp is proportional to the corresponding fraction of the cross 
section of the sphere of radius D = 1 AU centered on the Sun, so V^dp oc 2t[D 2 snip dp. 
The coefficient of proportionality is again found by normalizing the total probability to 
unity. Summarizing, the probability of crossing axis/shock is: 


V^p) = 1/V max for MC axis, 
sin p 


1 — cos p r 


for shocks. 


( 6 ) 

( 7 ) 


With the above result for V^p), Equation (5) defines V{\) when dp/d\ is known, i.e. 
when the shape is known since the derivation of Equation (4) with respect to A defines 
dp/d\. All in all, V(X) writes: 


P(A) = V v (<P> 


( 8 ) 


cos 2 A(—d 2 hip/dp 2 ) ' 

This expression is common for both the MC axis and the shock, as V v (p) can then 
be replaced by its proper expression from Equation (6) or Equation (7). Since p can be 
expressed as a function of A with Equation (4) when p(p) is specified, 'P(A) in Equation (8) 
can be written as a function of A. 
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A quantitative comparison between two probability distribution functions, with one 
corresponding to the real shape V reSi \ (he. estimated from observations) and the other 
corresponding to a given synthetic model P rno d e i , can be done by computing the distance 
between the two probability curves, as: 


dist(real, mod.) 



(”Weal(A) — T'modelM) dA . 


(9) 


With a given analytical model of p(tp) for the MC/shock shape, Equation (8) provides 
a continuous function of A. However, since the observed probabilities are binned, the 
comparison requires to also bin the probability obtained from the synthetic model (the 
probability function of this binned model is then noted 'Pbmod(A)). Then, Equation (9) is 
transformed into: 


cliff (obs., mod.) = . — (^obs(Ai) - Pbmod(Ai) 

\ n b i=l 

where is the number of bins of the observed probability V 0 h s ■ 


( 10 ) 


3.2. Wood’s model 

We first start our investigation of the most appropriate synthetic model with the Wood’s 
model. In a series of papers, Wood and co-authors used images from both STEREO 
spacecraft to visually fit the ICME front and/or the leading and trailing edges of flux 
ropes with the following model: 

Pw(<p) = Pmax CXp( —]<£>/<t| Q: /2) . (11) 

The parameter er defines the azimuthal extension and a characterizes how flat the apex 
is [see Wood et a/., 2009a; Wood and Howard, 2009; Wood et al, 2010, 2011, 2012], 

With a cylindrical rotation, Equation (11) defines a surface, and Wood et al. [2009a] set 
a thin shell of density aronnc! it to model the bright front of ICMEs. Then, Wood et al. 
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computed synthetic images by simulating the Thomson scattering. The comparison with 
the observed COR and HI images of ICMEs observed by both STEREO spacecraft leads 
to the determination by visual inspection of the best values for a and a. The results are 
ICME case-dependent with a in the interval [25°,43°] and a in [2,3.3]. 

Wood et al. [2009a] also used Equation (11) to define a density shell model around a 
flux rope, as follows. They Erst defined the flux rope leading and trailing edges with 
Equation (11) and with two sets of a and a values. Then, they defined a flux-rope like 
boundary with an elliptical (or circular) cross section passing through these two edges. 
Finally, they included a density layer around the cross-section boundary. The visual fit of 
the simulated brightness images with both STEREO data determines the best parameter 
values. The resulting o values are in the interval [21°, 40°] and the a values in [2, 8]. Wood 
et al. did not explicitly model the axis of the flux rope, but since its shape is between the 
leading and trailing edge shapes, we show below the same range of a and a parameters 
for the axis as quoted above. 

In order to derive the statistical properties for the in situ data implied with the Wood’s 
model, dehned by Equation (11), we apply below the equations derived in Section 3.1. 
Then, tan A is obtained from Equation (4): 

tan A = a 2 . (12) 


For the shape to not be sharp at the apex, i.e. A = 0 for ip — 0, the condition a > 1 should 
be satisfied. Next, since tan A is a monotonously growing function of ip, the inversion of 
Equation (12) provides: 


[Pw (^) 


'2a a tan A A 1/(a 1} 


a 


(13) 
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The profile of Equation (11) has no physical meaning for large \<p>\ values, i.e. for very 
small p values (as the shape would spiral around its origin). We then set a minimum value 
P > Pmin, which implies \ip\ < <^ max and A < A max with: 


= «t(21, 1 ^) 1/ “, (14) 

Pmin 

A max = tan" 1 (a<7““^/2) . (15) 


In Figure 5 we selected p min = 0.2 AU and p max = 1 AU, which implied A max ranging from 
74° to 87° for a ranging from 2 to 8. The shape defined by this model, Equation (11), is 
shown in the left panel of Figure 5 with different colors for four values of a (= 2,4, 6, 8). 
With the same colors, the corresponding probabilities of A for the axis and the shocks, 
Equation (16) below, are shown in the central and right panels, respectively. These 
theoretical results are compared with the observed probability drawn with histograms for 
MCs analyzed by Lepping & Wu and for shocks analyzed by Wang et ah, respectively. 

The probability V{X) is computed from Equation (8), where ip is replaced with Equa¬ 
tion (13), so that the final expression is expressed as a function of A : 


(U VviVv) ■ 2-a \ -a\\ 

V W (X) = ——— -sin A cos A 

a — 1 V oi ) 


!/(«-!) 


with V v {(p w ) defined by Equation (6) or Equation (7). Since a > 1, a/(a — 1) > 0 
and V W (X) is always growing to infinity as A approaches 90° (infinite branch). If one 
restricts ip to the interval [0, <p max ], A < A max < 90° so that the singularity disappears in 
P W (A). However, V W (X) remains a sharply growing function of A for large A values, and 
this behavior starts at lower values of A for low values of a (see the curves in the middle 
and right panels of Figure 5). V W (X) is also singular at A = 0, again with an infinite 
branch, for a > 2 for the MC axis case (Figure 5, middle panel) and for a > 3 for the 
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shock case (right panel). This behavior is due to the relationship between A and p as 
shown in Figure 6 for two a values (panels) and four a values (color curves). For low 
and large values of A, the curves are flat. With a uniform distribution of p, this implies a 
larger accumulation of cases with similar A where the curve is more horizontal, so a larger 
probability 'P(A) for MC axis and shocks. 

The above properties of V W (X) imply strong differences with the observed probabilities 
of the location angle for both the MC axis and the shock normal (Figure 4). This is 
shown in Figure 5 (middle and right panels), where the observed A probabilities for both 
MC axis and shocks, deduced from lists of MC/shock events, are put in the background 
for comparison (light blue histograms). The probability curves for the synthetic model 
of Wood et al. [2009a] are computed with the parameter a in the range deduced from 
STEREO observations. A similar behavior is found for all a values, as illustrated with 
the evolution of X(p) with two values of cr in Figure 6. As a increases, p extends on a 
broader interval, while A (p) keeps a similar shape implying similar V W (X). 

As such, Wood’s model, as described by Equation (11), is too blunt around the apex, 
then too tightly curved on the flanks and again too flat in the legs/flanks to provide 
a satisfying evolution of V W (X) comparable with in situ observations. Even scanning 
the most appropriate range of a (within [2,4]), for which V W (X) is less singular, does 
not provide a probability distribution comparable to any of the observed distributions 
(Figure 4). Indeed, the differences between the model and the observations always remain 
of the order of the mean probability value (cliff rs 13 to 33 x 1CT 3 with cliff defined by 
Equation (10)). 
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We conclude that the description of the axis and the shock shapes by Equation (11) does 
not provide any probability distribution compatible with the statistical in situ results. At 
first sight, the shapes shown in the left panel of Figure 5 are not too far from the typically 
expected shape (e.g. as shown in Figure lb and as observed by STEREO). However, V(\) 
is a very sensitive function as it contains a second derivative of the shape (Equation (8)). 
It implies that, even with limited statistics, the observed probabilities V(X) provide strong 
constraints on the axis and shock shapes. 

3.3. Cosine model 

Other analytical models can be used to study the properties of the statistical distribu¬ 
tions of parameters for both shocks and MC axis. For example, a simple analytical model 
was introduced by Janvier et al. [2014a] to describe the mean shape of shocks: 

Pc{v) = PmaxCOS"(/^) , (17) 

with / = 90°/V max so that p(<p max ) = 0. This model can similarly describe the shape of 
the MC axis. In the following, we refer to this model as the cosine model, and all relevant 
parameters are denoted with a subscript “c”. The shape defined by the cosine model is 
shown in the left panel of Figure 7 for five values of the product n /, with n f the product 
between n and /, the most sensitive degree of freedom in this model. 

For an apex fixed at a given distance, e.g. 1 AU (where the Wind and ACE spacecraft 
are located), this model has only two parameters (as the Wood’s model): {n,/} or equiv¬ 
alently {n/,<p max }. Computing dlnp c /d<p and inverting Equation (4) allows to express <p c 
explicitly as a function of A: 

.,. 1 , /tan A\ . . 

<p c (A) = -tan i—yj . (18) 
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The probability V C (X) of Equation (8) is explicitly computed by including a second deriva¬ 
tion of din p c /dtp. Eliminating <p with Equation (18), the expression for V C (X) is rewritten 
as 


Pc( A) 


^ ( ^ n(l + tan 2 A) 

V ^ c) (n f) 2 + tan 2 A ' 


(19) 


With p> c expressed with Equation (18), V C (X) becomes an explicit function of A. 

The probability V C (X) is strongly dependent on the product nf as shown in Figure 7 
for the axis and the shocks in the central and right panels, respectively, with the same 
color convention than for the shapes (left panel). Values of n f > 0.5 give V C (X) functions 
that are too flat compared with the observed probabilities (as reported in the background 
for both MC axis and shock normal). This is especially true for the shocks, since the 
observed decrease of V(X) for large A is not reproduced for nf> 0.5 (e.g., the blue curve 
for nf = 0.7 is monotonously increasing). On the contrary, values nf < 0.25 imply 
that the TV (A) functions are too peaked near the origin compared with the observations 
(e.g., the black curve for nf = 0.2). Furthermore, we found the same constraints on nf 
by comparing the cosine model to the other observed probabilities (e.g. those shown in 
Figure 4). It implies that the nf values compatible with observations are located in a 
narrow interval [0.25,0.5]. This corresponds to a well-constrained shape as shown in the 
left panel of Figure 7, in between red and pink shapes, because a large modification of 
V C (X ) implies only a small deformation of the shape. 

We quantify the above results by minimizing the least square difference between the 
observation and model distributions, so minimizing the function “cliff” defined in Equa¬ 
tion (10) for each observed probability. This provides the best parameters nf and </? max . 
In fact, <£> max has a very weak effect on V C (X) [Janvier et al ., 2014a] so that only the best 
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nf value is determined. The results are summarized in Table 1. For MC axis, nf is 
almost the same, « 0.35, for the data of Lepping and Wu [2010] and Feng et al. [2010], 
while n f is slightly larger, ~ 0.5, for Lynch et al. [2005] because its V{\) has a larger tail 
for large A values (Figure 4). For shocks, all the results are clustered around nf — 0.3. 

The above results imply that the deduced mean shape of MC axis is nearly independent 
of the data set selected. This includes partly different MCs (see Section 2.4), and more 
importantly different input in the analyzed procedure from different authors [e.g. the MC 
axis depends critically on the author’s choice of the MC boundaries, Dasso et al., 2006]. 
The deduced mean shape of shocks is also very close from the one deduced from the data 
sets of Feng et al. [2010] and Wang et al. [2010] while the determined shock normals have 
significant differences for the same analyzed shock (Section 2.4). 

3.4. Ellipsoidal model 

The ellipsoidal model was introduced by Janvier et al. [2013] to describe the mean shape 
of the MC axis. Its derivation is less simple than for the above cosine model, although 
V(X) can still be derived analytically. The MC axis is described by an ellipse of half size a 
and b in the radial and orthoradial directions, respectively (Figure 8). The ellipse centre 
is at a distance d from the Sun, and we define a point M on the axis where the spacecraft 
crosses the structure, situated at a distance p e from the Sun: 

p e = \)(d + a cos 5) 2 + (6 sin <5) 2 , (20) 

where 6 is the angle defining the position of M from the ellipse centre. The azimuthal 
coordinate, (p e , writes 

tarn^e = bsm5/(d + a cos 5). (21) 
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These equations, although first defining the MC axis, can similarly describe the shock 
shape (considering a surface symmetric around the Sun-apex line) as a parametric curve 
(p e (6),(p e (6)). The maximum angular extension of the model, </? max , is defined by: 

tan y?max = b/Vd 2 — a 2 . (22) 

For an apex fixed at a given distance, this model has only two parameters similarly to the 
two previous models: a/d and b/d, or equivalently b/a and 
The location angle A is expressed with Equation (4) as: 

. a sin 5 cos <p e — b cos S sin 

tan A = —:——- - -t-. (23) 

a sm o sm p e + b cos o cos cp e 

Both <p e (d) and A(5) are monotonously growing functions of 6, then tp e is an implicit 
function of A. 

The derivation of the probability V e (X) requires several computation steps as outlined 
in Janvier et al. [2013]. The result is: 

V e (\) = V v {<p e )/\d\/d<p\, (24) 

with 

dA 1 + tan 2 5 a d + a cos 5 

dip 1 + (a/b) 2 tan 2 5 b cos (p e a sin 5 sin <p e -fi b cos S cos ip e 

With Equations 21 and 23 providing the monotonous functions (p e ( A) and 5(A), V c (\) is 
an implicit single-value function of A. 

We show in the left panel of Figure 9 the different shapes obtained with such a model for 
different aspect ratios. We also show the associated probability functions (colored curves) 
on top of histograms of the observed distributions of A for both MC axis (middle panel) 
and shock normal (right panel), with the same drawing convention than in Figure 7. For 
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both MC axis and shock normals, the probability function V e (X) matches the observed 
V(X) the best for a value b/a ~ 1.3 (green curve). This corresponds to an ellipsoidal shape 
slightly elongated in the orthoradial direction. Although the parametrised curve does not 
change much for nearby values, the probability V e (X) is very sensitive to the parameter 
b/a as there are large differences from the observed V(X) for these nearby values, e.g. 
b/a = 1.1 and 1.5 (red and pink curves respectively in Figure 9). Compared to the 
cosine model, the ellipsoidal model provides an even closer match with the observations, 
especially for large A values of the V(X) function for shock normals (e.g., one can compare 
the green curves in Figures 7 and 9). 

We find the best ellipsoidal description of the MC axis and shock front by using the 
least square difference technique (as in Section 3.3). In Figure 10, we show how to obtain 
the best fits for two different samples of MCs and shocks with a pair of plots for each 
sample. We also show three values of ip meLX (color curves). For each sample, we show in 
the left panel of each pair the evolution of the cliff value, Equation (10), as a function 
of the ellipse aspect ratio b/a. The lowest values of cliff correspond to the best fits, and 
the associated probability functions are then reported on top of the observed probability 
distribution in the right panel of each pair. 

The minimum cliff values are all similar for low 9? max values (e.g. 15° and 30°) for 
both MCs and shocks (Figure 10), while they increase for larger </9 max values (e.g. 60°). 
Indeed the probability of A is weakly dependent of </? max for low <£> max values [ Janvier 
et al, 2013, 2014a], then ICMEs with variable can be analyzed together. However, 
the probability function evolution, for large </ 2 max values, V C (X ) has a too large tail for 
A > 60° values compared with observations (as shown with the green arrows in Figure 10), 
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especially for shocks (bottom panels, right panel of each pair). As such, within the limits 
of small statistics for large A values, large values of </? max are not consistent with the 
observed probability functions, which indicates that ICME shocks are not typically so 
spatially broad. This is in agreement with the predominance of solar sources of near-Earth 
ICMEs close to central meridian [64% are within 20° of central meridian Richardson and 
Cane, 2010] as well as with a mean half angular extension of CMEs of 30° estimated from 
coronagraphic observations from CMEs launched close to the solar limb to minimize the 
projection effects [Wang et al, 2011]. 

The best b/a values are summarized in Table 1 for <p max = 30°. In all cases a lower 
minimum cliff value is found for the ellipsoidal model, confirming the conclusion drawn 
above from Figures 7 and 9 that the ellipsoidal model is a better fit than the cosine model. 
For MC axis, b/a is close to 1.3 except for the data of Lynch et al. [2005] with b/a ~ 1.1, 
indicating a slightly more bent shape. For the shock normal, the value of b/a that fits 
the best with observations is around 1.4, except for the data of Feng et al. [2010] for 
which b/a ~ 1.2, indicating also a slightly more bent shape. These variations of b/a are 
comparable to the uncertainty of b/a ~ ±0.1 for both MC axis and shock normals as 
indicated by the minimum region extension of the cliff function (Figure 10). 

More globally, a comparable shape is expected for the MC axis and the shock surface as 
the flux rope is only separated from the shock by the sheath (Figure lb). This is indeed 
what is found here with both cosine and ellipsoidal models, when their 'P(A) is fitted to 
observations (Figures 7 and 9). This is quite a remarkable result, considering that 1) 
the in situ data sets and 2) the techniques to find the MC axis and shock normal are all 
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different. These comparable results from different types of data provide a strong case for 
the shape determined by cross-validating the results. 

3.5. Robustness of the derived axis and shock shapes 

The A values deduced for the different samples of MCs and shocks, and analyzed by 
different authors, show a large dispersion (Section 2.4, Figure 2). This seems a priori 
incompatible with the results above, where similar axis and shock shapes are derived 
from these various data sets (Sections 3.3 and 3.4, Table 1). To understand these results, 
we investigate the effect of the A error level in the Appendix (Section A) for both the axis 
and the shock normal on the determined shapes. 

We investigate the implications of errors on 'Pobs(A), then on the deduced MC axis and 
shock shapes. We conclude that, while there are large fluctuations in the A estimations 
by different authors for the same events (Figure 2), the observed probability distributions 
'PobsW a re less affected by these errors because of the averaging implied when building an 
histogram. Furthermore, the deduced axis and shock shapes are even much less affected by 
these errors because they depend only on the global properties of the histograms. Indeed, 
increasing the estimated dispersion, cr c , by up to a factor three only has a weak effect on 
the deduced shapes (Figure 12, left panels, the same color being used for the same case 
on corresponding left and right panels of a panel pair). Moreover, the deconvolution of 
'Pobs(^) by a Gaussian kernel sets an upper limit to the standard deviation of the A errors, 
up to ~ 18° for MC axis and ~ 8° for shock normals. Within such limits, the deduced 
MC axis and shock shapes are not significantly influenced by the error level on A. 

The above results contrast with the results of Feng et al. [2010] who compared the 
MC axis and shock normal directions directly with each MC-shock pair. The large errors 
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shown in Figure 2 do not allow to derive meaningful results from this direct approach. 
However, with the same data set, we can perform a statistical analysis and derive the 
generic MC axis and shock shape as shown above. 

4. Summary of the main results 

In the present paper, we propose an original statistical study based on different cata¬ 
logues of flux ropes and shocks observed in situ. We compare different analytical models 
to derive and quantify the most probable generic flux rope axis and shock shape. While 
in situ data provide only local information along the spacecraft trajectory crossing a 
MC/shock, our method combines the information from large sets of events to statistically 
derive global information on their generic shape. 

Our study is based on a series of papers reporting fitted parameters associated with 
flux ropes inside magnetic clouds [Lynch et al ., 2005; Feng et al, 2010; Lepping and Wu, 
2010] as well as parameters associated with properties of shocks driven by ICMEs [Feng 
et al, 2010; Wang et al ., 2010]. While the FR fits are all made with the Lundquist model, 
and the shocks with the MHD Rankine-Hugoniot relations, differences in their application 
lead to a dispersion of the deduced parameters for the same events (Section 2.4, Figure 2). 
This dispersion is more important for the location angle A, which defines the location of 
the spacecraft crossing the interplanetary structure (see Figure lb for the definition of 
A). However, the probability distributions of observed A have comparable behavior for all 
data sets (decreasing function for the MC axis, and a Gaussian-like distribution peaking 
around A ~ 30° for shock normals, see Figure 4). 

We then compared the observed distributions with those obtained from three synthetic 
models for the MC axis and shock fronts: the Wood’s (from Wood et al. 2009b), cosine 
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[.Janvier et al ., 2014a] and ellipsoidal [ Janvier et al., 2013] models. We investigated the 
different analytical distribution functions V{\) for each model, by varying the shape of 
the MC axis/shock. We scanned the range of possible parameter values for each model 
and we determine the best model parameters by computing the difference between the 
distribution function 'P(A) with each observed distribution 'Pobs(A). 

The Wood’s model was developed to analyze imager data of CMEs obtained from 
three view points (the two STEREO and SOHO spacecraft). While this model pro¬ 
vides axis/shock shapes which at first look plausible, its derived probability function of A, 
P W (A), is incompatible with all the observed distributions, 'Pobs(A), of MC axis and shock 
for all the range of parameters derived by fitting this model to imager data (Figure 5). 
Also, this model has a too flat shape at the apex, which would indicate that most mag¬ 
netic clouds and shocks would have been crossed at a low A, which is not the case. Then, 
these differences are intrinsic to the model rather than to the specific CME cases studied 
with the imaging instruments, and cannot also be accounted as the result of an evolution 
of the shape from the inner heliosphere to 1 AU. 

By contrast to the Wood’s model, both the cosine and ellipsoidal models are able to 
reproduce the observations, 'Pobs(A), within a narrow range of parameter values (Figures 
7 and 9). Still, the ellipsoidal model provides the best fit for both the A distribution for 
MC axis and shock normals for all data sets (Section 3.4). The best ellipsoidal shape is 
obtained for an aspect ratio b/a ~ 1.2 for the MC axis and b/a ~ 1.3 for the shock normal. 
This is quite a remarkable result, first because it allows us to define the quantitative 
generic shape of the MC axis and the shock front, and second because it shows that both 
structures have a similar shape, while MC and shock data are independent. 
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Moreover, although the observed probability distributions 'Pobs(A) have some differences 
from one sample of MC/shock to another (Figure 4), the results of the ellipsoidal and 
cosine models are close for different data sets (Figure 10 and Table 1), confirming that 
the derived shapes of axis/shock is weakly dependent on the details of V 0 b s (X). Indeed, 
for any model, V(X) is a very sensitive function of the axis/shock shape as it contains a 
second derivative of the shape (Equation (8)). This implies that the global behavior of 
V(X) defines precisely the generic shape of MC axis/shock. 

We investigated the reason for such weak dependence on the MC axis/shock shapes, by 
analyzing the effect of statistical errors introduced in 'Pobs(A). We found in particular that 
different levels of errors introduced moderate modifications of the probability distributions 
while the modifications of the axis/shock shape are minor (Figure 12). Indeed, a significant 
change of the shapes would require a large modification of V 0 b s (X), as shown in Figs. 5, 
7 and 9. We conclude that the observed distributions 'Pobs(A) set a strong constraint on 
the generic axis/shock shapes. 

5. Conclusions and Implications 

We have derived the generic shape of MC axis and ICME shocks from different published 
catalogues of events computing the MC axis or shock normal from in situ data. The 
MC fits are all made with the Lundquist model, and the shocks are analyzed with the 
MHD Rankine-Hugoniot relations. Both methods have their own limitations. However, 
these catalogues have presently the largest number of studied cases compared with other 
techniques. Moreover, our statistical methods developed to deduced the shapes of these 
structures can be applied to any other data samples that provide the MC axis and/or the 
shock normals. This is a further motivation to extend other catalogues. 
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Deriving generic shapes for the MC axis and shocks has several implications. First, 
they could be used in analysis of imager data with single viewpoint or stereoscopic obser¬ 
vations, as follows. Presently the imager data of ICME require a model in order to derive 
physical properties such as the velocity and the direction of propagation. Past studies 
were derived with simple models (the CME front is either supposed to have a negligible 
extension, or to be spherical around the Sun or attached to it, or to be described by 
Wood’s model). However, the results are sensitive to the model selected [e.g. Mostl et al., 
2014], We propose the ellipsoidal model, derived from our statistical results, as an alter¬ 
native model to analyze imager data. Nonetheless, with this elliptical model, it is possible 
to generalize the equations derived by Davies et al. [2013] for a detached spherical model 
to an ellipsoidal model of the ICME front. Introducing a more elaborated model implies 
more free parameters which are not necessarily constrained by imager data. However, the 
in situ data constrain well the aspect ratio 6/a of the model to a narrow range of values, 
then they provide a reference case for application to imager data. 

A second implication of our results is for forecasting ICME arrival times at Earth. The 
arrival time depends on the solar launch time, the velocity history of the ICME front 
during the transit and also on the shape of the front for ICMEs impacting Earth away 
from their apex. This last effect can provide delays up to two days as shown by Mostl and 
Davies [2013] with a circular front. Introducing an ellipsoidal shape could be important 
away from the apex to better define the front position, but also to better determine the 
ICME trajectory from imager data (previous paragraph), so finally where the front is 
crossed. Then, the ellipsoidal model is expected to improve our forecast abilities as it 
provides a step forward in modeling the ICME front shape with constraints derived from 
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in situ data. A first application of this was performed on a very fast ICME observed by 
seven spacecraft [Mostl et al., 2015]. The deduced aspect ratio, 1.4 ± 0.4, is comparable 
to the one deduced above from in situ measurements. 

Finally, the present analysis of in situ data can be extended to other solar distances. 
This requires a consistent set of in situ measurements at different distances away from the 
Sun. How would the shapes of the MC axis and associated interplanetary shock change 
with helio-distance? Would the expansion be self-similar for both structures? In that 
sense, the future missions Solar Probe Plus and Solar Orbiter will be of great interest to 
advance our knowledge of the evolution of ICMEs in the inner heliosphere. 

Appendix A: Effects of errors on the derivation of axis and shock shapes 
Al. Estimation of the A-error distribution 

The differences between the A values obtained by different authors for the same event 
were analyzed in Section 2.4. We quantified the large dispersion of the A values obtained by 
different authors (Figure 2), and reported the distribution of the values of the A differences 
in Figure 11 with histograms. The mean of this distribution is negligible for MCs (~ 1°) 
and small for shocks (~ 7°). The standard deviation is smaller for MCs (<7 0 b s ~ 11°) than 
for shocks (<r 0 bs ~ 19°). It is remarkable that even with a limited number of common cases 
(45 MCs and 36 shocks), implying large statistical fluctuations within each histogram bin, 
both distributions are comparable to a normal (Gaussian) distribution (red curve) with 
the same mean and standard deviation. 

We analyze below the implications of A errors on 'Pobs(A), then on the deduced MC axis 
and shock shapes. To do so, we propose to first estimate the error distributions for MCs 
and shocks, and to provide a simple model of the error distribution. Then, we attempt to 
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remove these errors on T’obs(A), and finally, we re-introduce variable level of errors in the 
newly obtained cleaned distribution to study their implications on the deduced MC and 
shock shapes. 

The errors in the determination of A are spread in the observed probability distributions 
of A (Figure 4). In order to estimate the A error distribution, we decompose A as A = 
Atme + Aerror where A tm e is the true A value and A er ror the error due to the limitations of both 
observations (data only along the spacecraft trajectory) and modeling (boundary selection 
and fit of the data to a model, see Sections 2.1 and 2.2). Considering two samples A and 
B of common events, we can only access the distribution of A er ror,A — A er ror,B (Figure 11) 
and not the individual distributions of A err or,A and A er ror,B (as we do not know A tm e)- Since 
the mean values of the A eri . 0 r,A — A err or,B distributions are small (Figure 11), both for MCs 
and shocks, there are only weak systematic biases. This small bias can also be observed 
in the left panels in Figure 2, with a similar number of cases above or below the identity 
straight line (brown line), and also by comparing blue (fitted) and brown lines. Next, 
we assume that A err or,A and A errorj B are independent statistical variables with the same 
standard deviation a a = <7b- Since the variance of the difference of two independent 
variables is the sum of their two variances ( cr 2 = a \ + (x|), then <ta = <tb is a factor l/\/2 
lower than the standard deviation found in Figure 11. For MCs, this implies a a = °b ~ 8°, 
and for shocks <ta = cr& ~ 14°. Next, since the distributions of A er ror,A — A eiT or,B for MCs 
and shocks are comparable with a normal distribution (Figure 11), we suppose below that 
the error distribution of each set of observations is a normal distribution with a = (X 0 b s /\/2: 

n„(A) = ^=e- (A/ ’ )a/2 . (Al) 
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In this framework, the observed distribution, 'Pobs(A), is the result of the true distribution, 
^true(A), convoluted with V eTT (X) (called the kernel of the convolution). 

A2. Deconvolution of the observed probabilities 

The deconvolution of a signal is a delicate problem, in particular since we only have 
a crude approximation of the kernel with Equation (Al) [Press et al., 1992, chap. 13]. 
Moreover, there are boundary effects at A = 0° and 90°. In order to limit them we 
take into account the properties of 'Pobs(A). At A = 0°, 'Pobs(A) is maximum for MCs 
and close to zero for shocks. We keep these properties by imposing a symmetric (anti¬ 
symmetric) distribution for MCs (shocks), respectively. In the vicinity of A = 90°, the 
distributions are small so we fix A > 90° with zero values on an interval larger than the 
kernel used. We compare different techniques of deconvolution present in the Mathcmatica 
software (damped least square, Wiener filter and total variation). The first one provides 
the best results since the estimated 'Ptme(A) distribution has less oscillations and the back 
convolution of the results by the same kernel is closer to the original distribution. 

The results of the deconvolution with V err (X) with a standard deviation <Td, for both 
MCs and shocks studied by Feng et al. [2010], are shown in Figure 12 with the blue curves 
on the right panels. For MCs, we round Ud ~ 8° to 10°, so closer to the shock value of 
<Td = 14°. The convolution of these results by the same kernel ( a c = <Td) are shown with 
the red curves. They are a smoothed version of the original distributions. Next, the 
green and pink curves are the result by a convolution larger by 10° and 20°, respectively 
(<j c — or = 10° and 20°). The distributions 'P(A) broaden as a c is increased, as expected. 
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A3. Implications for the derived MC and shock shapes 

We find the best ellipsoidal model for each distribution, similarly to the procedure of 
Section 3.4. While the value of a c changes significantly the A distributions, its effect is 
small on the derived shapes (Figure 12, left panels). Indeed, there is almost no difference 
between the shapes derived from 'Pobs(A) (histograms) and its deconvoluted version (blue 
curves). This result could be anticipated from Figure 9 which shows that a large change 
of 'P(A) is needed to have a meaningful change of the derived shape. Indeed, even further 
convoluting 'Pobs(A) with a kernel broader by 10° or 20°, so increasing the error on A, only 
results in a slightly more bent axis. Quantitatively, for a c — <7d = 0°, 10° and 20° (red, 
green and pink curves), we obtain b/a = 1.3,1.2 and 1.1 for MCs, and b/a = 1.2,1.1 and 
1.0 for shocks, respectively. Comparable results are also found with the cosine model with 
a weak effect of the A error magnitude on the parameter nf, then on the deduced shape. 

We also explore the effect of a broader kernel for the deconvolution. Increasing <7d implies 
a deconvoluted V(X) with more oscillations. This is a classical result of the deconvolution, 
with even negative values appearing in a deconvoluted function when the deconvolution 
kernel is broader than the original function. This is already present for <Td = 14° for 
shocks in Figure 12 and the probability 'P(A) has been represented with the constraint 
V{ A) > 0 and a renormalisation of the probability to get an integral equal unity. This 
unphysical negative probability indicates that or = 14° is an over-estimation of the errors 
(said differently, with such dispersion on the estimations of A, 'Pobs(A) should be broader). 
With the above results from Figure 12, this further implies that the computed shock shape 
is weakly affected by errors on A. 
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Figure 1. Diagram of a MC and associated parameters: (a) perspective view, (b) 
side view. The MC axis is drawn in blue, its borders in black and one representative 
magnetic field line is added. The inclination angle, i, is defined by the angle made by the 
axis direction projected on the y-z plane with the y direction (in GSE coordinates). The 
maximum angular extension of the MC axis, as viewed from the Sun, is Wax,axis while 
the shock extends up to Wax,shock- The location angle A is defined by the angle between 
the radial direction from the Sun and the normal to the MC axis or the shock. 
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Figure 2. Correlations between the absolute value of the location angle A (defined in 
Figure lb) values (left panels), and correlations between the inclination angle values, i 
(right panels) obtained for the same cases in different samples of MCs and shocks. We 
show the values of |A| (noted A for conveniency) for the MCs for a better comparison 
with the shocks (for which the sign of A cannot be determined). The two first rows 
are the correlations for MCs, with the orientation parameters found by Lepping and Wu 
(2010) reported on the horizontal axis. The values found by Lynch et al. (2005, top) and 
Feng et al. (2010, middle) are reported on the vertical axis. The last row represents the 
correlation between the parameters obtained by associating the shocks studied by Feng et 
al. (2010) with those by Wang et al. (2010). The parameters c p and c s are the Pearson and 
Spearman ranking-correlation coefficients, and ”sd” is the standard deviation, in degree, 
between the two data sets (red points). The blue line is the least square fit for the data, 
gd^le^tljte fprown line represents th^j^^ti^g fu^tjgn (jff-Qgq ^a l values of A, or i, for q§ic^ a F T 
set of events). These results show a correlation both for A and i between different data 
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^axis(i) MC axis, Lepping & Wu A axis (i) MC axis, Lynch et al. A^p) MC axis, Feng et al. 



^shockW All shocks, Wang et al. Af. hock (i) MC shocks, Wang et al. Af. hock (i) MC shocks, Feng et al. 
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Figure 3. Histograms of the inclination angle i (defined in Figure la) for different 
data sets as written in the top labels. N is the total number of cases in the histogram. 
iVaxis (iVshock) is the number of MCs (shocks) in each bin, plotted versus i for the MC axis 
(top row) and the shock normal (bottom row). The 20 bins are regularly spaced between 
i = —180 and 180°. The top left and middle panels show the total number of cases in 
blue, while the overplotted histograms show cases constrained to an impact parameter \p\ 
lower than a given threshold (|p| < 0.7 in magenta and \p\ < 0.5 in orange). This way, 
the bars in blue are always the longuest ones in the histogram, as they correspond to the 
number of cases without constraints on \p\, and are covered with the magenta and light 
orange bars. The bottom left and middle panels show the distribution of i for the shocks 
reported in Wang et al. [2010], either taken all together (left) or with the constraint that 
a MC was observed following the shock (middle). These histograms all show no global 
tendency with i (within the statistical fluctuation limit). 
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N axis(A) 


MC axis, Lepping & Wu 


-A^axis(A) 


MC axis, Lynch et al. 


N axis(A) 


MC axis, Feng et al. 



AdhockW All shocks, Wang et al. 


A^shock W MC shocks, Wang et al. ^shockW 


MC shocks, Feng et al. 



Figure 4. Histograms of the location angle A (defined in Figure lb) for different data 
sets as written in the top labels. N is the total number of cases in the histogram. N ax i s 
(Ad, hock) is the number of MCs (shocks) in each bin, plotted versus A for the MC axis (top 
row) and the shock normal (bottom row). The 20 bins are regularly spaced between A = 0 
and 90°. The color convention is the same as in Figure 3. These histograms are examples 
of distributions of A used to constrain the MC axis and shock shapes. 
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Shapes of Wood's model Probabilities of Afor MCs 


Y(AU) ^axisW - a=2 



Probabilities of Afor shocks 

P shock(A) 



Figure 5. Shapes and probabilities per unit A, in degree, for the Wood’s model 
(Section 3.2). The left panel shows the MC axis or the cross-section of the shock in a 
plane including the Sun (located at the origin) and the axis/shock apex, as defined by 
Equation (11) and limited to p > p m \ n = 0.2. The colored lines are for four different 
values of the a parameter. The corresponding probabilities of A are shown in the middle 
and right panels together with, in the background, the observed probability drawn with 
histograms for MCs analyzed by Lepping & Wu (middle panel) and for shocks analyzed 
by Wang et al. (right panel). The large differences between the colored lines representing 
Wood’s model and the histograms from the observed distribution show that the model is 
not consistent with the observations. 
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A(° 


for Wood's model 

o=30° 




Figure 6. Variation of the location angle A as a function of <p for the Wood’s model 
(Section 3.2) for four values of a and two values of a. The left panel has the same 
parameter values as in Figure 5. These results are used in Section 3.2 to analyze the 
incompatibility of Wood’s model with the observed distributions of A (Figure 5). 
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Shapes of cosine model Probabilities of Afor MCs 



Probabilities of Afor shocks 

7* shock(A) 



Figure 7. Shapes and probabilities per unit A, in degree, for the cosine model (Sec¬ 
tion 3.3). The left panel shows the MC axis or the cross-section of the shock, as defined 
by Equation (17), for different values of the n f parameter. The corresponding probabili¬ 
ties of A are shown in the middle and right panels together with, in the background, the 
observed probability drawn with histograms for MCs analyzed by Lepping & Wu (middle 
panel) and for shocks analyzed by Wang et al. (right panel). The colored lines show that 
the cosine model is the most consistent with the observed probability distributions for the 
case with n f ~ 0.3. 
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Table 1. Best fitted cosine and ellipsoidal models to various data sets. 


Observations 


cos. 

mod. 

chip. 

mod. 

data set 

N case 

nf 

clifT 

b/a 

clifT 

results with MC axis 

Lepping & Wu, all 

107 

0.36 

4.5 

1.28 

4.1 

Lepping & Wu, quality l,2 b 

74 

0.36 

6.9 

1.25 

6.4 

Lynch et al. 

132 

0.50 

5.0 

1.10 

4.4 

Feng et al. , axis 

62 

0.34 

4.8 

1.29 

4.5 


results with shock normal 


Feng et al. , shock 

62 

0.40 

7.8 

1.21 

6.9 

Wang , all 

216 

0.29 

4.5 

1.39 

3.7 

Wang , ICME not detected 12 

99 

0.32 

6.0 

1.32 

5.1 

Wang , all ICME 

117 

0.26 

4.4 

1.45 

3.6 

Wang , non-flux rope ICME 

36 

0.23 

6.1 

1.53 

5.8 

Wang , MC-likc 

36 

0.23 

6.5 

1.57 

5.8 

Wang , MC 

45 

0.32 

7.8 

1.36 

7.3 


a The difference computed with Equation (10) and multiplied by 1000. 
b The quality is defined in Lepping et al. [1990] according to the x 2 value of the fit of a 
flux-rope model to data. Here the two best groups are used. 

c Shocks not followed by an ICME. Most of them are thought to be shock flanks which 
extend beyond the ICME [see Janvier et al., 2014a]. 
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Figure 8. Diagram defining the elliptical model for the flux-rope axis or shock shape. 
M is the point of interest (where the spacecraft crosses the structure). <p e is the angle of 
the cylindrical coordinates (p e ,(p e ). The location angle A is defined between the normal 
and the local radial from the Sun. The maximum angular extension <^ max is outlined by 
radial segments tangent to the ellipse. 5 is the angle used to parametrized the ellipse. 
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Shapes of ellipsoidal model Probabilities of A for MCs 




Probabilities of Afor shocks 



Figure 9. Shapes and probabilities per unit A, in degree, for the ellipsoidal model 
(Section 3.4). We added a straight sunward edge to the ellipsoidal shape (dashed line) 
to visualize the maximum extension ±<p max (defined in Figure lb). The left panel shows 
the MC axis or the cross-section of the shock, as defined by Equation (20), for different 
values of the aspect ratio b/a parameter. The corresponding probabilities of A are shown 
in the middle and right panels together with, in the background, the observed probability 
drawn with histograms for MCs analyzed by Lepping & Wu (middle panel) and for shocks 
analyzed by Wang et al. (right panel). The comparison between the colored lines and the 
histograms shows that the ellipsoidal model with b/a ~ 1.3 best fits with the observed 
data. 
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Lepping & Wu (2010) MCs Feng et al. (2010) MCs 



Wangetal. (2010) shocks Fengetal. (2010) shocks 



Figure 10. Minimisation of the difference between the observations and the ellipsoidal 
model (Section 3.4). The top row shows results for the MC axis using two different samples 
of MCs, while the bottom row shows shock normal results using two samples of shocks. 
The difference function is defined by Equation (10) and it is plotted as a function of the 
aspect ratio b/a in the left-hand plot of each pair. The results are shown for three values 
of ^max (equal to (^ m ax,axis for MCs and to <^ maXj shock for shocks as defined in Figure lb). In 
the right-hand plot of each pair, the probabilities of the best models (he. minimising cliff 
of Equation (10)) are shown together with, in the background, the observed probability 
drawn with histograms. The green arrows show how the tail increases with (p max = 60°. 
The main result is that the deduced shape, characterized by b/a minimizing the difference 
function, is almost independent of <^ max for low values (< 60°) while for larger </? ma x value 
the ellipsoidal model is further away from observations. 


DRAFT 


March 23, 2015, 12:23am 


DRAFT 





































































JANVIER ET AL.: COMPARING GENERIC MC AXIS & SHOCK SHAPES 


X - 61 


'P(dA) Lepping & Wu ; Feng et al. P(dA) Wang et al.; Feng et al. 



Figure 11. Probability distributions of the difference of A, in degree, obtained by two 


author-groups studying the same events (left: MCs, right: shocks). The correlation of 


the A values are presented in the middle and bottom left panels of Figure 2. A Gaussian 


function with the same mean and standard deviation is superposed in red. These results 


are used to estimate the error distribution of A for MCs and shocks. 
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Probabilities of Afor MCs 
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Figure 12. (Half-)Shapes and probability distributions of MC axis (top panels) and 
shock front (bottom panels) deduced from observations with various dispersions of the 
MC axis and shock normal. The observed probability (histogram, Feng et al. , 2010) is 
deconvolved with a Gaussian kernel, Equation (Al), with a standard dispersion <7d = 10° 
for MC and dd = 14° for shocks (derived from Figure 11, see Section A). The resulting 
curve is plotted as a blue curve, rather than an histogram, for superposition. The other 
curves are the results of a further convolution by a Gaussian kernel with a standard 


dispersion cr c . The red curves have a c = <7d resulting only in a smoothing of the original 
histogram. The green and pink curves have a c > <7d resulting in a broadening of the 
original histogram. The main result is that the deduced MC axis and shock shapes are 
weakly affected by the errors on A even when these errors are enhanced by a factor 3 
compared with the estimations from observations (Figure 11). 
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